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Abstract We prove that, unless P = NP, no polynomial algorithm 
can approximate the minimum length of synchronizing words for a given 
synchronizing automaton within a constant factor. 



Background and overview 

Let = {Q,E,6) be a complete deterministic finite automaton (DFA), 
where Q is the state set, U is the input alphabet, and 6 : Q x U ^ Q 
is the transition function. The function 6 extends in a unique way to a 
function Q x U* ^ Q , where U* stands for the free monoid over U ; the 
latter function is still denoted by 6. Thus, each word in U* acts on the 
set Q via 5. The DFA .s/ is called synchronizing if there exists a word 
w G S* whose action resets , that is to leave the automaton in one 
particular state no matter which state in Q it starts at: 6{q,w) = 6{q',w) 
for all q,q' G Q . Any such word w is called a synchronizing word for 
The minimum length of synchronizing words for =e/ is denoted by 
min^ynchl-c^)- 

Synchronizing automata serve as transparent and natural models of 
error-resistant systems in many applications (coding theory, robotics, test- 
ing of reactive systems) and also reveal interesting connections with sym- 
bolic dynamics and other parts of mathematics. For a brief introduction 
to the theory of synchronizing automata we refer the reader to the recent 
survey [H]. Here we discuss only some complexity-theoretical issues of the 
theory. In the following we assume the reader's acquaintance with some 
basics of computational complexity that may be found, e.g., in |3|6| . 

There is a polynomial algorithm (basically due to Cerny [1]) that de- 
cides whether or not a given DFA is synchronizing. In contrast, determin- 
ing the minimum length of synchronizing words for a given synchronizing 
automaton is known to be computationally hard. More precisely, deciding. 



given a synchronizing automaton and a positive integer whether or 
not vamgynchi,^) < ^ is NP-complete |2|5|9|8| . Moreover, deciding, given 
the same instance, whether or not min^y^chl-^^) = ^ is both NP-hard and 
co-NP-hard [8]. Thus, unless NP = co-NP, even non-deterministic algo- 
rithms cannot find the minimum length of synchronizing words for a given 
synchronizing automaton in polynomial time. 

There are some polynomial algorithms that, given a synchronizing au- 
tomaton, find synchronizing words for it, see |2|10|7| . Such algorithms can 
be considered as approximation algorithms for calculating the minimum 
length of synchronizing words but it seems that they have not been sys- 
tematically studied from the approximation viewpoint. Experiments show 
that Eppstein's greedy algorithm [2] behaves rather well on average and 
approximates Tcivusynchi^si) within a logarithmic factor on all tested in- 
stances; however, no theoretical justification for these observations has 
been found so far. 

In this paper we prove that, unless P = NP, no polynomial algo- 
rithm can approximate the minimum length of synchronizing words for a 
given synchronizing automaton within a constant factor. This result was 
announced in the survey [H] (with a reference to the present author's 
unpublished manuscript) but its proof appears here for the first time. We 
also mention that a special case of our result, namely, non-approximability 
of mingy^c/j(j2/) within factor 2, was announced by Gawrychowski [4]. 

The paper is organized as follows. First we exhibit an auxiliary con- 
struction that shows non-approximability of min^ynchl-s^^) within factor 

2 — e for automata with 3 input letters. Then we show how to iterate this 
construction in order to obtain the main result, again for automata with 

3 input letters. Finally, we describe how the construction can be modified 
to extend the result also to automata with only 2 input letters. 

1 Non-approximability within factor 2 — e 

First we fix our notation and introduce some definitions. When we have 
specified a DFA si = {Q, U, 6) , we can simplify the notation by writing 
q.w instead of 6{q, w) for q € Q and w G S* . For each subset S <Z Q and 
each word w £ S* , we write S.w instead of {q-ui \ q £ S} . We say that a 
subset 5 C Q is occupied after applying some word v £ S* ii S CI Q.v. 

The length of a word w £ S* is denoted by . If 1 < s < , then 
w[s] denotes the letter in the s-th position of w; similarly, if 1 < s < t < 
\w\ , then t(;[s..t] stands for the word t(;[s]t(;[s -|- 1] • ■ ■ w[t] . 



Let /C be a class of synchronizing automata We say that an algorithm 
M approximates the minimal length of synchronizing words in JC if, for 
an arbitrary DFA =2/ € /C, the algorithm calculates a positive integer 
M{£/) such that M{£/) > mmsynchi-s^) ■ The performance ratio of M at 

£/ is Rm(-si^) = ^ — r~7T • The algorithm is said to approximate the 

minimal length of synchronizing words within factor € M if 

sup{Rm{-^) \£/ gIC} = k. 

Even though the following theorem is subsumed by our main result, 
we prove it here because the proof demonstrates underlying ideas in a 
nutshell and in the same time presents a construction that serves as the 
induction basis for the proof of the main theorem. 

Theorem 1. // P 7^ NP, then for no e > a polynomial algorithm 
approximates the minimal length of synchronizing words within factor 2—e 
in the class of all synchronizing automata with 3 input letters. 

Proof. Arguing by contradiction, assume that there exist a real number 
e > and a polynomial algorithm M such that Rm{-^) < 2 — e for every 
synchronizing automaton with 3 input letters. 

We fix an arbitrary n > 2 and take an arbitrary instance il) of the clas- 
sical NP-complete problem SAT (the satisfiability problem for a system 
of clauses, that is, formulae in conjunctive normal form) with n variables. 
Let m be the number of clauses in ijj . We shall construct a synchroniz- 
ing automaton with 3 input letters and polynomial in m, n num- 
ber of states such that min^y^ch (-2/ ('(/')) = n + 2 \i ijj is satisfiable and 
mm.synchi'^ {i^)) > 2(?i — 1) if ■0 is not satisfiable. If n is large enough, 
namely, n > f — 2 , then we can decide whether or not tp is satisfiable by 
running the algorithm M on ^ (-0) • Indeed, if V is not satisfiable, then 
M{£/{ip)) > mmsynchi-^ (fp)) > 2(?i — 1), but, if ip is satisfiable, then 

M{£/{4;)) <{2-£) min,ynch{^{tP)) = (2 - e)(n + 2) 

6 

^(2-^)(n + 2) = 2(n-l). 

Clearly, this yields a polynomial algorithm for SAT: given an instance 
of SAT, we can first, if necessary, enlarge the number of variables to at 
least f — 2 without influencing satisfiability and then apply the above 
procedure. This contradicts the assumption that P / NP. 



Now we describe the construction of the automaton £/{ip) = {Q, U, §) . 
The state set Q of £/{tp) is the disjoint union of the three following sets: 

= {qij \ l<i<m+l, l<j<n+l, i^m+lorjj^n+l}, 
52 = {pi j I 1 < i < m + 1, 1 < j < 71 + 1}, 
= {^1,^0}. 

The size of Q is equal to 2(m + l)(n + 1) + 1 so a polynomial in m,n. 

The input alphabet i7 of ^ (tp) is the set {a, b, c} . In order to describe 
the transition function 5 : Q x U ^ Q , we need an auxiliary function 
/ : {a, 6} X {l,...,m} x {l,...,n} Q defined as follows. Let the 
variables involved in ip be xi, . . . ,Xn and the clauses of ip be ci, . . . , Cm • 
For a literal y G {xi, . . . , Xn, ~^xi, . . . , -^Xn} and a clause Cj , we write 
y G Cj to denote that y appears in q . Now set 



zq if d = a and xj € c, , 
zq if d = b and -^Xj G Cj 



^Qij+i otherwise. 

The transition function 6 is defined according to the following table: 



State q £ Q 


6{q,a) 


d{q,b) 


6{q,c) 


Qij for 1 < i < m,l < j < n 


f{a,i,j) 


fib,i,j) 


Qi,i 


Qm+ij for 1 < j < n 


Qm.+lJ+l 


Qm+l,j+l 


Qm+1,1 


Qi^n+i for 1 < i < m 


ZQ 


ZQ 


qm+1,1 


Pij for 1 < i < m + 1,1 < j < n 


PiJ+l 


Pi,j+l 


Pi,j + 1 


Pi^n+i for 1 < i < m + 1 


ZQ 


ZQ 


Qi,i 


Zl 


qrn+1,1 


qm+1,1 


ZQ 


Zq 


ZQ 


ZQ 


Zq 



Let us informally comment on the essence of the above definition. 
Its most important feature is that, if the literal Xj (respectively -^Xj) 
occurs in the clause Cj, then the letter a (respectively b) moves the state 
qij to the state zq . This encodes the situation when one can satisfy the 
clause Ci by choosing the value 1 (respectively 0) for the variable Xj. 
Otherwise, the letter a (respectively b) increases the second index of the 
state. This means that one cannot make q be true by letting Xj = 1 
(respectively Xj = 0), and the next variable has to be inspected. Of 
course, this encoding idea is not new, see, e.g., [2]. 



By the definition, zq is the zero state of the automaton . Since 

there is a path to zq from each state q £ Q, the automaton is 
synchronizing. 

Figure [T] shows two automata of the form si/ {ijj) build for the SAT 
instances 

■01 = {^l V 2:2 V X3, -iXi V X2, -'X2 V X3, -1X2 V -1X3}, 
^'2 = {xi V X2, -iXi V X2, -'X2 V X3, -1X2 V -1X3}. 

If at some state q £ Q the picture has no outgoing arrow labelled d £ U , 
the arrow q —>■ zq is assumed (all those arrows are omitted in the picture 
to improve readability). The two instances differ only in the first clause: in 
ipi it contains the variable X3 while in tp2 it does not. Correspondingly, the 
automata ^(■01 ) and £/{ip2) differ only by the outgoing arrow labelled a 
at the state ^1^3: in it leads to zq (and therefore, it is not shown) 

while in £/{^p2) it leads to the state 51^4 and is shown by the dashed line. 

Observe that ipi is satisfiable for the truth assignment xi = X2 = 0, 
X3 = 1 while ip2 is not satisfiable. It is not hard to check that the word 
cbbac synchronizes £/{ipi) and the word a^c is one of the shortest reset 
words for {1P2) ■ 

To complete the proof, it remains to show that minsynch {£/i^p)) = n+2 
if ip is satisfiable and mmsynchi-^ {''P)) > 2(n — 1) if is not satisfiable. 
First consider the case when ^p is satisfiable. Then there exists a truth 
assignment r : {xi, . . . , x„} — > {0, 1} such that Ci(r(xi), . . . , r(x„)) = 1 
for every clause Cj of ^p . We construct a word v = v{t) of length n as 
follows: 

I if T[Xj) = 0. 

We aim to prove that the word w = cvc is a synchronizing word for £/ (ip) , 
that is, Q.w = {zq} . Clearly, zi.c = zq. Further, S'^.cv = {zq} because 
every word of length n + 1 that does not end with c sends S'^ to zq . Now 
let T = {qi^i I 1 < i < m + 1} , so T is the "first row" of . Observe 
that S^.c = T . Since Ci(r(xi), . . . ^T{xn)) = 1 for every clause q , there 
exists an index j such that either Xj G c, and r(xj) = 1 or -iXj G q and 
r(xj) = 0. This readily implies (see the comment following the definition 
of the transition function of £/{ip)) that qi^i.v = zq for all 1 < i < m. 
On the other hand, qm+i,i-v = zi because every word of length n that 
does not involve c sends qm+1,1 to zi. Thus, S^.cv = T.v = and 
S^.w = {zq} . We have shown that w synchronizes £/(ip)^ and it is clear 
that \w\ = n + 2 as required. 



Now we consider the case when is not satisfiable. 

Lemma 1. // il^ is not satisfiable, then, for each word v € {a,b}* of 
length n, there exists i <m such that qi^n+i S T.v. 

Proof. Define a truth assignment r : {xi, . . . , x„} {0, 1} as follows: 



Since "0 is not satisfiable, we have Ci{T{xi), . . . , r(x„)) = for some clause 
Ci, 1 < i < m. According to our definition of the transition function 
of £/{ip), this means that qij-v\j] = Qij+i for all j = l,...,n. Hence 



Lemma 2. // i/j is not satisfiable, then for each word v € {a, b}* of 
length n and each letter d € U , the state qm+i,i belongs to T.vd. 

Proof. If d = c, the claim follows from Lemma [1] and the equalities 
Qm+i,! = qi,n+i-c that hold for all i < m. If d 7^ c, we observe that 
the state (?m+i,i is fixed by all words of length n + 1 not involving c. □ 

Let w' be a synchronizing word of minimal length for .s/{tp) and 
denote w = cw'c. Then the word w is also synchronizing and £ = \w\ > n 
because already the length of the shortest path from Qm+i,! to zq is 
equal to n + 1. Let k be the rightmost position of the letter c in the word 
u;[l..n] . 

Lemma 3. TCQ.w[l..k]. 

Proof. Indeed, since A; < n, for each 1 < i < m + 1 we have 



We denote by v the longest prefix of the word w[k + such that 

V S {a,b}* and \v\ < n. Since w ends with c, the word v cannot be a 
suffix of w . Let d £ E he the letter that follows v in w . If \v\ = n, then 
Lemma [2] implies that qm+i,i £ T.vd. If \v\ < n, then by the definition of 

V we have d = c. Hence 



Thus, qm+1,1 € T.vd also in this case. Combining this with Lemma [3], we 
have 




qi,n+i = qi,i-v G T.v. 



□ 



Pi,„+2-fc-tf - l]w[k] = Pi,n+i-c = qi,i G T. 



n 



qm+l,l-Vd — qm+l,\v\+l-C — Qm+1,1- 



Q.w[l..k]vd 5 T.vd B qm+i,i- 



(2) 



Prom the definitions of k and v it readily follows that w[k + l..n] is a 
prefix of v whence \v\ > n — k. Thus, |t(;[l..A;]?;d| > k+{n — k) + l = n + 
Recall that the length of the shortest path from qm+i,i to zq is equal to 
n + 1, and the suffix of w following w[l..k]vd must bring the state qm+i,i 
to zq in view of ([2]). Hence \w\ > (n + 1) + (n + 1) = 2?i + 2 > 2n and 
\w'\ > 2(n — 1) . We have proved that mmsynchi-^ (4')) > 2(n — 1) if is 
not satisfiable. □ 



2 The main result 

The main result of this paper is 

Theorem 2. If V ^ NP, then no polynomial algorithm can approximate 
the minimal length of synchronizing words within a constant factor in the 
class of all synchronizing automata with 3 input letters. 

Proof. Again we fix an arbitrary n > 2 and take an arbitrary instance 
•0 of SAT with n variables. We shall prove by induction that for every 
r = 2, 3, . . . there exists a synchronizing automaton s^rii^) = {Qr-, ^r) 
with the following properties: 

- E = {a,b,c}; 

- \Qr\ is bounded by a polynomial of n and the number m of clauses 
of -0; 

- if -0 is satisfiable under a truth assignment r : {xi, . . . , Xn} —>■ {0, 1} , 
then the word w = c^~^v{t)c of length n + r synchronizes ^(V') 
(see for the definition of the word v{t)); 

- mmgynchi-^r) > r{n — 1) if ■0 is not satisfiable. 

Then, applying the same standard argument as in the proof of Theorem [H 
we conclude that for no e > the minimal length of synchronizing words 
can be approximated by a polynomial algorithm within factor r — e. Since 
r can be arbitrarily large, the statement of the main result follows. 

The induction basis is verified in the proof of Theorem[T] we can choose 
the synchronizing automaton ^/^ip) to play the role of ^("0). For the 
sake of uniformity, in the sequel we refer to the state set Q of £/ (-0) and 
its transition function 6 as to Q2 and respectively 62 • 

Now suppose that r > 2 and the automaton i24--i(0) = {Qr-i, ^r-i) 
with the desired properties has already been constructed. We let 



Qr = Qr-l [jiQ2 \ Uo}) X Qr-l- 



Clearly, |Qr| = |Qr-i| • IQ2I and from the induction assumption it follows 
that \Qr\ is a polynomial in m, n. 

We now define the transition function 6r '■ Qr x ^ ^ Qr ■ Let d G i7 , 

q £ Qr ■ q & Qr~i , then we set 

6riq,d) = 6r-iiq,d). (3) 
U q = {q', q") G {Q2 \ {zq}) x Q^-i , we define 



6r{q,d) 



zq if S2{q',d) = zo, 



if 52{q',d) = grn.+i,i and either 
q' = qi,n+i for i G {1, . . . , m} 
or gr' = qm+i,j for j G {2, . . . , n} 
or q' = zi, 
{62{q',d),q") in all other cases. 

Using this definition and the induction assumption, one can easily verify 
that the state zq is the zero state of the automaton £/r{'>P) and that there 
is a path to zq from every state in Qr- Thus, ^rii^) is a synchronizing 
automaton. 

In order to improve readability, we denote the subset {qij} x Qr-i by 
Qij for each state qij G 5^ and the subset {pij} x Qr-i by Pij for each 
state Pi J G S'^ . Slightly abusing notation, we denote by T the "first row" 
of 5^ X Qr-i, i.e. T = Ui<i<m+i ■ Similarly, let P = Ui<i<m+i -Pi.i 
be the "first row" of 5^ x Qr-i ■ We also specify that the dot-notation (like 
q.d) always refers to the function 6r. 

First we aim to show that if ip is satisfiable under a truth assignment 
r : {xi, . . . ,Xn} {0,1}, then the word w = c^~^7;(t)c synchronizes 
the automaton .s/rii^) ■ By ^ and the induction assumption we have 
Qr-i-c ^ Qr-i and Qr._i.c^~^f (r)c = zq. Further, we can decompose 
(('32 \ {zq}) X Qr-i).c as {zq} U Fr-i U Fr for some sets Fr-i Q Qr-i 
and Fr C (Q2 \ {zq}) x Qr-i ■ By the induction assumption, 

Fr_i.c''~^f (r)c C Qr-i.c'~'^v{T)c = Zq 

Consider the set F^ . Using the definition of the action of c on Q2 via 82 , 
one can observe that Fr = TuG where G stands for 5^ x Qr~i\P . From 
g| we see that T.c = T and G.c C T U G. Thus we have F,..c''~2^(r)c C 
T.v{t)c U G.v{t)c, and combining the first alternative in ^ with prop- 
erties of the automaton ^ (■;/') established in the proof of Theorem [U we 
obtain T.v{t)c = G.v{t)c = {zq} . 



Now we consider the case when tp is not satisfiable. The following 
lemma is parallel to Lemma [T] and has the same proof because the action 
of a and b on the "blocks" Qij with 1 < i < m and 1 < i < n via 5^ 
precisely imitates the action of a and b on the states qij in the automaton 
see the last alternative in 

Lemma 4. // il) is not satisfiable, then, for each word v € {a, b}* of 
length n, there exists i < m such that Qi^n+i ^ ^r{T,v). □ 

In contrast, the next lemma which is a counterpart of Lemma [2] uses 
the fact that in some cases the action of the letters via 6r drops states 
from ((Q2 \ zo) X Qr~i) down to Qr-i, see the middle alternative in 

Lemma 5. // ip is not satisfiable, then for each word v € {a, b}* of length 
n and each letter d G S , we have Qr-i C 6r{T,vd) . 

Proof. If d = c, the claim follows from Lemma |4] and the equalities 
5r{{<li,n+i,(l"),c) = q" that hold for al\ i < m and all q" G Qr-i- If 
d 7^ c, we observe that 6r{{qm+i,ii q")iv) = {zi,q") and 6r{{zi,q"),a) = 
6r{{zi , q"), b) = q" for all q" £ Qr^i . □ 

Let w' be a synchronizing word of minimal length for ^•(V') and 
denote w = cw'c. Then the word w is also synchronizing and £ = \w\ > 
(r — l)n by the induction assumption. Let k be the rightmost position 
of the letter c in the word i(;[l..n]. We have the next lemma parallel to 
Lemma [3] and having the same proof (with the "blocks" Pij with 1 < i < 
m + 1, n + 2 — k<j<n playing the role of the states Pij)- 

Lemma 6. T C 5r{Qr,w[l..k]) . □ 

Now, as in the proof of Theorem [U we denote by v the longest prefix 
of the word w[k + such that v G {ci,b}* and \v\ < n. Clearly, v 
cannot be a suffix of w . Let d £ E he the letter that follows u in it; . If 
|u| = n then Lemma [5] implies that Qr-i ^ 5r{T,vd). If \v\ < n, then by 
the definition of v we have d = c. Hence 

SriQm+l,l,vd) = 5r{Qm+l,\v\+l,c) = Qr—1- 

Thus, Qr-i Q 6r{T,vd) also in this case. Combining this with Lemma [U 
we have 

6r{Qr,w[l..k]vd) 5 5r{T,vd) 5 Qr—1- (5) 

Prom the definitions of k and v it readily follows that |f| > n — k. 
Thus, |it;[l..fc]v(i| >k + {n — k) + l = n + l. The suffix of w following 



w[l..k]vd must bring the set Qr-i to a single state in view of ([5|). However, 
by ([3]) the restriction of 6r to Qr-i coincides with 6r-i whence the suffix 
must be a synchronizing word for . By the induction assumption 

mmsynchi-^r-iiip)) > {r — l)(n — 1), and therefore, 

\w\ > (n + 1) + (r - l)(n - 1) = r(n - 1) + 2 

and \w'\ > r(n — 1) . We have thus proved that mmsynch{'^ri'>P)) > r(n — 1) 
if ^p is not satisfiable. This completes the induction step. □ 



3 The case of 2-letter alphabets 

We show that the main result extends to synchronizing automata with 
only 2 input letters. 

Corollary 1. // P 7^ NP, then no polynomial algorithm can approximate 
the minimal length of synchronizing words within a constant factor in the 
class of all synchronizing automata with 2 input letters. 

Proof. For any synchronizing automaton £/ = (Q, {oi, 02, 03}, 5) we can 
construct a synchronizing automaton ^ = {Q', {a, b}, 5') such that 

mmsynchi-s^) < "omisynchi'^) < Sminsynchi-s^) (6) 

and \Q'\ is a polynomial of \Q\. Then any polynomial algorithm approxi- 
mating the minimal length of synchronizing words for 2-letter synchroniz- 
ing automata within factor r would give rise to a polynomial algorithm 
approximating the minimal length of synchronizing words for 3-letter syn- 
chronizing automata within factor 3r . This would contradict Theorem [2l 
We let Q' = Q x {01,02,03} and define the transition function 6' : 
Q' X {a, 5} Q' as follows: 

6'{{q,ai),a) = (g, amm{j+i,3)), 
6'{{q,ai),b) = ((5(g, Oj), ai). 

Thus, the action of a on a state q' £ Q' substitutes an appropriate letter 
from in the alphabet {01,02)03} of for the second component of q' 
while the action of b imitates the action of the second component of q' on 
its first component and resets the second component to oi . Now is let a 
word w € {oi, 02, 03} of length £ be a synchronizing word for £/ . Define 

b if w[s] = oi, 
ab if w[,s] = 02, 
aab if w[,s] = 03. 



Then the word v = bvi ■ ■ ■ V£ is easily seen to be a synchronizing word for 
^ and \v\ < 3£ unless all letters in w are 03 but in this case we can just 
let 02 and 03 swap their names. Hence the second inequality in ([6]) holds 
true, and the first inequality is clear. 
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